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ON  TESTING  TWO  THEORIES  REGARDING  THE  GENETIC  MAKEUP 
OP  PATIENTS  SUFFERING  FROM  UNIPOLAR  AFFECTIVE  DISORDER* 

Tim  Robertson  and  Giles  Warrack 

.»  f  4  *  '  >  \ 

SUMMARY 

In  an  article  in  the  Journal  of  Psychiatric  Research, 
-^Cadoret,  Woolson  and  Winokur  (1977)  consider^ two  theories 
regarding  the  genetic  makeup  of  patients  suffering  from  uni¬ 
polar  affective  disorder.'"'  Using  likelihood  ratio  techniques 
from  order  restricted  inference  we  analyze  their  data  and  com¬ 
pare  the  two  analyses. 


_  ii 

*This  research  was  sponsored  by  the  Office  of  Naval  Research 
under  Contract  N00014-80-C-0321 . 


INTRODUCTION .  In  an  article  in  the  Journal  of  Psychiatric 
Research,  Cadoret,  Woolson  and  Winokur  (1977)  consider  the 


genetic  factors  contributing  to  the  risk  of  unipolar  affective 

disorders  (u.a.d.).  They  consider  two  theories  accounting  for 

the  age  of  onset  of  u.a.d.:  ^1)  the  "qualitative"  theory,  which 

postulates  that  the  genetic  makeup  of  those  afflicted  early  In 

life  (under  40  years  old),  and  those  who  are  afflicted  later 

(over  40)  Is  somehow  different,  and  2)  the  "quantitative" 

theory  which  maintains  that  there  is  a  particular  type  of  gene 

causing  u.a.d.,  and  that  those  who  succumb  earlier  do  so  because 

r  ■>  ■/  o  * •  • 

they  possess  the  gene  in  greater  numbers  .irrc*-.' ordrr  >-<?7+r'  +*-»  h  nt  JV,  f. 

'■  At  T*  '  r  ,  l-_  1  ,  .J 

^0 i  hf  -i :  ‘>J  . 

In  their  investigation  Cadoret  et  al.  consider  samples  of 
767  women  and  398  men  suffering  from  u.a.d.  Each  sample  is 
divided  into  6  age  groupings  (3  less  than  40  and  3  greater  than 
40)  according  to  the  age  at  which  the  patient  first  suffered 
from  the  disorder.  For  each  age  group  they  have  obtained  (a)  the 
proportion  of  patients  having  alcoholic  fathers,  and  (b)  the  pro¬ 
portion  of  fathers  and  mothers  of  patients  within  the  group  who 
suffered  from  depression. 

These  data  reflect  on  the  genetic  makeup  of  the  families  in 
question  and  the  two  above-mentioned  theories  are  modeled  in 


terms  of  the  shapes  of  the  functions  relating  the  probability  of 
alcoholism  or  depression  in  parents  to  the  age  of  onset  of  the 
patients*  illness.  If  the  "qualitative"  theory  holds  then  the 
risk  function  should  be  constant  until  age  40,  drop  at  age  40, 
and  be  constant  thereafter.  If  the  "quantitative"  theory  holds 


2 


then  the  risk  function  should  simply  be  nonincreasing.  The 
shapes  of  the  risk  function  under  the  two  theories  are  shown  in 
Figure  1. 
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Figure  1.  Shapes  of  the  risk  function  under  the  two  theories 
explaining  the  age  of  onset  of  u.a.d. 


The  six  age  groupings  are  0-20  years,  20-29  years,  30-39 
years,  40-49  years,  50-59  years,  and  over  60  years.  Let  p., 
i  =1,2,  •••,6  be  the  respective  risks  for  these  age  groups 
(probability  of  an  alcoholic  father  is  one  case  and  the  probabil¬ 
ity  of  a  parent  suffering  from  depression  in  the  other  case). 

In  terms  of  statistical  hypotheses,  the  qualitative  theory  im¬ 
plies  that  Hq:  p^  *p^  *p^  =P^  ®Pg  while  the  quantitative 

theory  implies  that  HA:  p^  ipp  ^  •  ipg  •  Note  that  HQ  implies 

so  that  an  appealing  way  to  decide  between  the  two  theories 
would  be  to  use  the  data  to  test  HQ  as  a  null  hypothesis  when 
the  alternative  is  restricted  by  HA  (i.e.,  test  HQ  vs. 

HA  "H0  (HA  but  not  «0)). 


k. 


The  data  used  by  Cadoret  et  al.  (1977)  Is  contained  in 
Table  1.  In  analyzing  this  data  they  use  the  techniques  of 
order  restricted  inference  contained  in  Barlow,  Batholomew, 
Bremner  and  Brunk  (1972).  However,  distribution  theory  was  not 
available  for  testing  HQ  against  HA-HQ  and  Cadoret  et  al. 
tested  separately  the  hypotheses 

(a)  H^:  p1  =  p2  *  p3  vs.  -H'  where  H^:  p1  ^  p2  *  p3 

and 

(b)  Hj:  P4  =  P5  =  p6  vs.  H*-Hj  where  H ' :  P4  *  P5  *  pg . 

In  each  case  the  test  statistic  is  of  the  form  -2  In  A ,  where 

A  is  the  likelihood  ratio  and  the  asymptotic  distribution  is 
the  chi-bar-square  contained  in  Theorem  3.1  of  Barlow  et  al. 

Thus  for  each  set  of  data  they  computed  two  P-values.  These 
P-values  together  with  the  results  of  our  analysis  (described 
later)  are  given  in  Table  2. 

There  are  a  number  of  techniques  for  combining  P-values 

from  independent  tests.  One  of  the  more  appealing  techniques 

is  based  upon  the  facts  that  the  null  hypothesis  distribution 

of  a  P-value  is  uniformly  distributed  over  (0,1)  and  that 

-2  ln(U)  (U  is  distributed  uniformly  on  the  interval  (0,1)) 

has  a  chi-square  distribution  with  two  degrees  of  freedom.  Thus 

in  Table  2  we  have  combined  the  two  P-values  from  the  indepen- 

2 

dent  tests  by  computing  P[x^  *  -2 (In  p^  +  In  p2)],  where  p^ 
and  p^  are  the  P-values  obtained  for  the  two  separate  tests. 
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TABLE  1. 

Proportions  of  alcoholic  fathers  and 
parents  suffering  from  depression 

A.  FEMALES 

Age  at 

onset  of  illness 

Sample 

Size 

Proportion 
of  alcoholic 
fathers 

Proportion  of 
depressive  parents 

<  20  yrs. 

68 

.16 

.16 

20-29 

141 

.11 

.18 

30-39 

165 

.15 

.12 

40-49 

140 

.09 

.09 

50-59 

142 

.04 

.09 

60-69 

111 

.02 

.07 

B.  MALES 

Age  at 

onset  of  illness 

Sample 

Size 

Proportion 
of  alcoholic 
fathers 

Proportion  of 
depressive  parents 

<  20  yrs. 

33 

.09 

.21 

20-29 

82 

.08 

.10 

30-39 

67 

.13 

.15 

40-49 

83 

.05 

.15 

50-59 

58 

.03 

.09 

60-69 

70 

.04 

.10 

LIKELIHOOD  RATIO  ANALYSIS.  Consider  the  problem  of  testing  Hq 
against  the  alternative  -Hq.  In  order  to  be  specific,  we 
will  talk  about  the  fourth  set  of  data,  namely,  the  proportion 
of  depressive  parents  in  males  suffering  from  u.a.d.  (In  some 
ways  this  is  the  most  interesting  data  set.)  Let  p^ ;  i  =1,2, 

• • • ,6  denote  the  unrestricted  maximum  likelihood  estimates  of 
the  probabilities  P^,P2»*,'»Pg  (i.e.,  for  our  data  p^  =  .21, 

p>2  *  .  10  ,•••).  The  maximum  likelihood  estimates  subject  to  the 
restrictions  Hq  and  H^  can  be  found  using  any  one  of  several 
algorithms  in  Barlow  et  al.  (1972).  Perhaps  the  easiest  algo¬ 
rithm  is  the  pool  adjacent  violators  algorithm.  Let  p*;  i  =1, 

2,  •••,6  denote  the  maximum  likelihood  estimates  subject  to  the 
restriction,  H^.  Starting  with  the  unsmoothed  estimates  .21, 

. 10  , . 15 , . 15 » • 09  ,  • 10  the  values  of  p 2  and  p^  constitute  a 
violator  since  p^  <  p^.  They  are  both  replaced  by  their  weighted 
average,  namely,  (n2p2  + n^p^)/(n2+ n ^ )  =  ( 9+10 )/( 87+67 )  =  .12 

Ip  L 

(n^  is  the  number  of  probands  In  the  i—  group).  Note  that 

this  value  is  obtained  by  "pooling"  the  samples  from  the  2— 
rd 

and  3 —  groups.  We  now  consider  the  five  numbers,  .21,. 12, 

.15,. 09,.  10.  The  violators,  .12  and  .15,  are  replaced  by 
(n2P2+n3P3+  niJpi()/(n2+n-3+nil)  =  *13  and  the  violators  ,0Q  and 
.10  by  (n^p^+n^pg )/(n^+ng )  =  .09.  The  resulting  three  numbers 
are  decreasing  in  i.  Our  estimates,  restricted  by  H^,  are 
then  .21, .13, .13, .13, .09, .09. 

Let  p ^ ;  i  *1,2,  •••,6  denote  the  maximum  likelihood  esti¬ 
mates  subject  to  Hq.  These  estimates  are  obtained  by  pooling 
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the  data  corresponding  to  ages  less  than  40,  pooling  the  data 
corresponding  to  ages  over  40  and  then  pooling  the  resulting 
values  if  we  have  a  reversal.  For  the  male-depression  data, 
the  estimates  are  p^  =  .14;  i  =1,2,3  and  p^  =  .12;  i  =4,5,6. 
The  likelihood  ratio. 


A  = 


ini(p1)ni%i-pl)ni(1-£4) 

T^*7'Au-p;)ni<1-5l> 


is  then  computed  and  the  test  rejects  for  large  values  of  the 
statistic  T  =  -2  In  A.  For  this  data  the  value  of  T  is  8.51. 

The  authors  have  recently  been  able  to  derive  the  appropri¬ 
ate  limiting,  null  hypothesis  distribution  of  T.  The  test 
based  upon  T  is  not  similar  over  the  null  hypothesis,  H„. 
However,  one  can  show  that  if  n^  — »  <*>;  i  =1,2,  •••,6  in  such 
a  way  that  the  ratios  n^/n^  each  converge  to  some  strictly 
positive  number,  then 


sup  lim  P  [Tit]  =  L  ?Al  ,3]Pp[/?,3]P[X/  o*t] 

p€HQ  n^  -*o>  p  3*1^12*6  11  d  d  V*2"£ 

where  P  is  the  probability  computed  under  the  hypothesis  that 

P  2 

p  is  the  true  vector  of  probabilities,  denotes  a  standard 

chi-square  variable  with  i  degrees  of  freedom,  and 
represents  the  probability  that  the  estimates  p*,  p*  and  p* 
assume  distinct  values.  These  probabilities  may  be  com¬ 

puted  by  the  following  formulas  (see  Barlow  et  al.,  1972): 


•y 

I 


where 


pi(1>3)  -  *  -  &  sln_1  p 
P2(2,3)  =  | 

p1<3,3)  =  |  +  2^  sin-1  p, 

p  =  J _ ^3 _ 1  2. 

(n1+n2) (n2+n^) 


The  values  for  P2[^2,3]  are  computed  in  like  fashion.  It  is 
worth  noting  that  even  when  one  sample  size  is  3  or  ^  times  as 
large  as  another,  the  above  probabilities  remain  close  (i.e.,  to 
within  .01)  to  the  respective  probabilities  computed  under  the 
assumption  of  equal  sample  sizes,  these  being  P(l,3)  =  .3333, 
P(2,3)  =  .5,  and  P(3,3)  =  .1667.  The  limiting  distribution 
given  above  derives  its  form  from  the  fact  that  it  is 
the  probability  that  the  sum  of  two  independent  random  variables 
exceeds  the  value  t,  each  variable  being  distributed  according 
to  Bartholomew's  chi-bar-square  distribution  (see  Barlow  et  al., 
1972). 

Table  2  contains  the  value  of  the  likelihood  ratio  statis¬ 
tic  and  the  P-values  for  each  of  the  four  data  sets.  In  addi¬ 
tion,  the  P-values  for  separately  testing  H'  vs.  H^-H' 
and  HJ  vs.  H*-H'  are  given.  A  combined  P-value  has  been 
computed  using  the  facts  given  in  the  introduction. 


The  most  interesting  result  produced  by  this  analysis 


TABLE  2 .  P-values  for  testing  the  two  theories 


HC  vs.  Ha 

TEST 

H  •  vs  H  ' 
o  A 

H  "  vs  H  " 
o  A 

Combined 

p-value 

Male  Depression 

L-R  Stat 

3.51 

3.54 

3-10 

P-value 

0.015 

0.063 

0.073 

0.029 

Male  Alcoholism 

L-R  Stat 

0.11 

0.0 

0.11 

P-value 

0.78 

1.0 

0.518 

0.858 

Female  Depression 

L-R  Stat 

4.96 

4.17 

0.79 

P-value 

0.082 

0.043 

0.303 

0.069 

Female  Alcholism 

L-R  Stat 

7.94 

0.36 

7.60 

P-value 

0.019 

0 .288 

0-007 

0-014 
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concerns  male  patients  and  depressed  parents.  Neither  H'  nor 
H'  can  be  rejected  at  the  5%  level  of  significance.  However, 
the  P-value  for  the  likelihood  ratio  statistic  for  testing  Hq 
against  -Hq  is  .015.  It  is  also  interesting  to  note  that 
this  P-value  is  smaller  than  the  "combined"  P-value  implying 
stronger  evidence  against  Hq  than  against  =  p^s 

=  P5  =  P6- 
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